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Objectives: The purpose of this study was to design a prediction model that explains the characteristics of elderly adults at 
risk of malnutrition. Methods: Data were obtained from a large data set, 2008 Korean Elderly Survey, in which the data of 
15,146 subjects were entered. With nutritional status a target variable, the input variables included the demographic and 
socioeconomic status of participants. The data were analyzed by using the SPSS Clementine 12.0 program's feature selection 
node to select meaningful variables. Results: Among the C5.0, C&R Tree, QUEST, and CHAID models, the highest predict- 
ability was reported by C&R Tree with the accuracy rate of 77.1%. The presence of more than two comorbidities, living alone 
status, having severe difficulty in daily activities, and lower perceived economic status were identified as risk factors of mal- 
nutrition in elderly. Conclusions: A reliable decision support model was designed to provide accurate information regarding 
the characteristics of elderly individuals with malnutrition. The findings demonstrated the good feasibiUty of data mining 
when used for a large community data set and its value in assisting health professionals and local decision makers to come up 
with effective strategies for achieving public health goals. 
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I. Introduction 

Data mining is a recently developed technological method- 
ology that has been used intensively and extensively in many 
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fields. It is defined as the process of discovering previously 
unknown patterns or trends from stored data, building pre- 
dictive models based on that information. Compared with 
the past healthcare system, there has been an ever increas- 
ing amount of data generated by current healthcare settings. 
The amount of data generated in current healthcare settings 
is becoming too large and complex to be analyzed by tradi- 
tional statistical methods. Due to its good applicability to 
any field of study, data mining has gradually gained popular- 
ity in almost all areas of healthcare [1]. However, data min- 
ing has been used mostly to analyze disease-focused clinical 
data rather than community data gathered for primary care 
providers in Korea. The importance of accessing community 
health status has been widely recognized throughout the na- 
tion and data mining can empower local decision makers 
with a clear methodology for organizing and interpreting 
healthcare data [2] . 
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Data mining has been recognized as a solution to handle 
these large amounts of complex data, transforming these 
data into meaningful information for the decision making 
process of health professionals. A comprehensive communi- 
ty health profile plays an important role in the development, 
refining and monitoring of long-term strategies designed to 
improve the overall health status of the population across the 
nation. Such information obtained at the community level 
can be used to ensure that the best resource allocation choic- 
es are made to improve community health [3]. Along with 
Korean Elderly Survey, a number of national representative 
studies, such as the Korean Longitudinal Study of Ageing 
and the Korean Retirement and Income Study, have recently 
been conducted in Korea [4,5]. Currently, more healthcare 
data are available at the community level, enabling health 
professionals to make a transition from the previous disease- 
and treatment-focused strategy to a health- and prevention- 
focused strategy. This also clarifies the public health role, 
which should be supported by unbiased data that accurately 
reflect a community's health status, needs, and resource al- 
location [6]. 

The proportion of elderly population in Korea reached 
1 1.8% in 2012 which is expected to increase further account- 
ing for 24.3% in 2030 and 37.4% by 2050 [7]. The growing 
elderly population has a huge impact on the healthcare sys- 
tem, and malnutrition is a common health problems found 
frequently among the elderly [8]. The leading causes of mal- 
nutrition include several individual factors, such as poor oral 
health, loss of vision and hearing, dementia, impaired mobil- 
ity, and pain. Malnutrition is both a cause and consequence 
of many geriatric diseases that accounts for a significant pro- 
portion of national medical spending [9]. Nutritional status 
has a great influence on the immune system of the human 
body; the lower immune response induced by malnutrition 
eventually puts elderly individuals at a high risk of infection, 
increasing the risk of mortality [10]. A rapid and precise di- 
agnostic method that supports clinical decision enables the 
earlier identification of the elderly at risk of malnutrition. 
This will eventually prevent negative outcomes caused by 
poor nutritional status, resulting in a substantial reduction 
in healthcare cost spent on the elderly population [11]. Most 
studies to date have conducted research on small samples 
obtained from local hospitals or specific communities using 
traditional statistical analysis, which may not be sufficient 
for discovering comprehensive knowledge [12]. In addition, 
the limited number of variables analyzed in those studies 
could not provide integrative understanding of elderly mal- 
nutrition. 

Data mining is known to be the best method in terms of the 
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cost estimation, disease diagnosis, prediction of prognosis, 
and the discovery of hidden patterns in the healthcare system 
with the use of large health database [13]. A decision tree 
model presented in this study was developed from a combina- 
tion of decision tree approaches and statistical analysis. Data 
mining provides information that can be the used in the anal- 
ysis of risk factors for certain types of diseases. Contrasting the 
characteristics of ill patients with those of healthy people to 
find the patterns related to the occurrence of disease has been 
a common practice in evidence-based medicine [ 14] . The pur- 
poses of the study were to identify the best modeling method 
among data mining tools by testing the discriminatory power 
of individual models and to identify the patterns related to 
the occurrence of malnutrition in elderly. Finding patterns 
for the related characteristics of risky conditions that threaten 
the health of the community may contribute to the achieve- 
ment of public health goals with effective disease prevention 
strategies. By identifying the associated characteristics of older 
adults who are more likely to be malnourished in community 
settings, the decision tree model allows health professionals 
and decision makers to intervene earlier to allocate adequate 
resource for nutritional support, improving the overall health 
of the community. 

II. Methods 

1. Subjects 

A secondary analysis of a large community data set from 
the 2008 national study of Korean elderly was conducted to 
provide accurate information regarding the characteristics of 
the elderly at risk of malnutrition in Korea. The Institutional 
Review Board gave prior approval to the study. The sample 
consisted of 15,146 adults over 60 years old and community 
dwelling. The ten items of the 'Determine Your Nutritional 
Health Checklist' were used to access dietary intake and 
meal patterns. The tool was developed by the Nutritional 
Screening Initiative to screen nutritional risk in the elderly 
population. A summed score of 0-2 indicates good nutri- 
tional status, 3-5 moderate risks, and 6 or more high risk 
[ 15] . In this study, elderly people with summed scores of 3 or 
more were classified as being at risk of malnutrition as they 
pose problems for malnutrition. 

2. Decision Tree Models 

Figure 1 presents the data mining process that started with 
the selection of variables. The target variable was malnutri- 
tion, and the 52 input variables covered all aspects of elderly 
malnutrition, including general characteristics, family & social 
relationship, economic status, health status, health behavior. 
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The Korean elderly 
survey data in 2008 



Assessment of model performance 



Select the best model 



Find out the rules 



Selection of variables 




- Target: malnutrition 

- Input: 52 variables 






- Using partition node 
(training data: test data = 7:3) 

- Using feature selection node (base the 
p-value for categorical prediction on pearson) 


Data processing 







Application of tree-structured models 




- C5.0/CHAID/QUEST/C&R tree 





■ Using analysis node 
(percentage of correct & wrong) 



Figure 1. Process of data mining. 



After feature selection 



I Initial input variables | 



■ General char acteristics 

- Sex 
-Age 

- Education 

- Religion 

- Marital status 

- Family type 

■ Family & Social 
relationship 

- Living children 

- Siblings 

- Relative 

- Friends & neighbor 

■ Economic status 

- Currently having job 

- Monthly income 

- Subjective economy state 

• Health status 

- Health condition 

- The number of handicap 

- Diagnosed disease 

- Daily activity difficulty by 
disease 

- The number of pain site 

- Daily activity difficulty by pain 

- Medication 

- Falling down 

- Depress status 

■ Health behavior 

- Current smoking 

- Ever smoking 
-Alcohol drinking 

- Nutritive condition 



- Low physical activity 

- Moderate physical activity 

- Hard physical activity 

- Total sleeping hour 

- Sleep quality 

- Influenza vaccination 

- Medical examination 

- Hospitalization 

■ Functional status 

- MMSE grade 

- Eyesight 

- Daily activity difficulty by 
eyesight 

- Hearing 

- Daily activity difficulty by 
hearing 

- Chewing 

- Swallowing difficulty 

- BMI obesity level 

■ Leisure & social activity 

- Social activities 

- Lifelong education 

- Volunteer work 

* Aging & quality of life 

- Against old age 

- Life satisfaction 
-Abuse experience 

• Living environment 

- Poor level of surroundings 

- Convenience of house 

- Transportation 

- Difficulty on going out 



1 . Health condition 

2. Age 

3. Eucation 

4. Depress status 

5. Daily activity difficulty 
by disease 

6. Life satisfaction 

7. Daily activity difficulty 
by pain 

8. Hearing 

9. Difficulty on going out 

10. The number of handicap 

11. MMSE grads 

12. Transportation 

13. Currently having job 

14. Chew/ing 

15. Nutritive condition 

1 6. The number of pain site 

17. Eyesight 

18. Social activities 

19. Diagnosed disease 

20. Marital stats 

21. Daily activity difficulty by 
hearing 

22. Low physical activity 

23. Daily activity difficulty by 
eyesight 

24. Sleep quality 

25. Falling down 

V 



26. Family type 

27. Alcohol drinking 

28. Convenience of house 

29. Sex 

30. Living children 

31. Against old age 

32. Sibllings 

33. Swallowing difficuity 

34. Subjective economy 
state 

35. Monthly income 

36. Hospitalization 

37. Moderate physical 
activity 

38. Medication 

39. Abuse experience 

40. Medical examination 

41. Volunteer work 

42. BMI obesity level 

43. Poor level of 
surroundings 

44. Relative 

45. Friends & neighbor 

46. Current smoking 

47. Lifelong education 

48. Religion 

49. Ever smoking 



Final input variables 



1. Chewing 

2. Family type 

3. Depress status 

4. Life satisfaction 

5. Daily activity difficulty by pain 

6. Dagnosed diseases 

7. Subjective ecomony state 



Figure 2. Input variables and feature selection. 



functional status, leisure & social activity, quality of life, and 
living environment (Figure 2). The ratio between the train- 
ing data and test data was 7:3 by random division of the full 
data set to which four tree -structured models, including C5.0, 
CHAID, QUEST, and classification and regression (C&R) 
Tree, were applied. In an assessment of model performance, 
analysis node was used to calculate the percentage of correct 
and wrong classification of each model, and then the correct 
classification rates of the models were compared to select 



the best model. In this study, the C&R Tree showed the best 
predictive performance, and it was selected to draw the final 
decision support tree [16]. A decision tree is a visual repre- 
sentation of obtained knowledge using a tree form in which 
each branch represents an output of the test. This decision tree 
enables easy understanding and interpretation of data as the 
nodes and branches are organized hierarchically. It generates 
reliable knowledge, promoting accuracy in clinical decision- 
making [17]. The C&R Tree node is a tree-based classification 
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and prediction method. The C&R Tree approach provides a 
very simple representation that displays accumulated knowl- 
edge well. This method allows the tree to grow large before 
pruning, which results in smaller trees, and this ensures bet- 
ter cross-validation properties of decision tree modeUng [18]. 
The recursive partitioning process splits the records into seg- 
ments with similar output field values, and two subgroups are 
defined by the first split. The split ends when one of stopping 
criteria is triggered [19]. 
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3. Statistical Analysis 

The default experimental parameters of Clementine ver. 12.0 
(SPSS Inc., Chicago, IL, USA) were utilized to construct 
the decision tree model with the usage of the C&R Tree 
component in this study. SPSS ver. 12.0 for Windows (SPSS 
Inc.) was used to perform the statistical analysis in which 
important variables were chosen using the feature selection 
method. Among Pearson, Likelihood ratio, Cramer's V, and 
Lambda- a, Pearson correlation analysis was selected to pri- 
oritize the meaningful variables [20] . 



Table 1. General characteristics 



Variable Total (n = 15,146) 


Normal (n = 7,769} 


Malnutrition (n = 7,377) 


t 


p- value 


Sex 
Male 
Female 


6,185 (40.8) 
8,961 (59.2) 


3,581 (57.9) 
4,188 (46.7) 


2,604 (42.1) 
4,773 (53.3) 


182.49 


0.001 


Age (yr) 
60-69 
70-79 
>80 


7,090 (46.8) 
6,192 (40.8) 
1,864 (12.4) 


4,278 (60.3) 
2,790 (45.1) 
701 (37.6) 


2,812 (39.7) 
3,402 (54.9) 
1,163 (62.4) 


558.26 


0.001 


Education (yr) 
0-6 
7-12 
>12 


5,908 (39.0) 
3,646 (24.1) 
837 (5.5) 


3,137 (53.1) 
2,284 (62.6) 
584 (69.8) 


2,771 (46.9) 
1,362 (37.4) 
253 (30.2) 


746.08 


0.001 


Marital status 
Single 
Married 
Separated 


54 (0.4) 
9,380 (61.9) 
5,712 (37.7) 


14 (25.9) 
5,770 (61.5) 
1,985 (34.8) 


40 (74.1) 
3,610 (38.5) 
3,727 (65.2) 


1,031.72 


0.001 


Family type 
Living alone 
Living with spouse 
Living with children 


3,484 ^23.0; 
6,653 (43.9) 
4,256 (28.1) 


925 (26.5) 
2,432 (57.1) 


2,559 (73.5) 
1,824 (42.9) 


1,132.29 


0.001 


Monthly income (1,000 US $) 
<0.5 
0.5-0.9 

1- 1.9 

2- 2.9 
>3 


4,229 (27.9) 
3,877 (25.6) 
3,501 (23.1) 
1,685 (11.1) 
1,854 (12.2) 


1,356 (32.1) 
1,908 (49.2) 
2,153 (61.5) 
1,100 (65.3) 
1,252 (67.5) 


2,873 (67.9) 
1,969 (50.8) 
1,348 (38.5) 
585 (34.7) 
602 (32.5) 


1,106.11 


0.001 


Employment 
Yes 
No 


5,204 (34.4) 
9,942 (65.6) 


3,138 (60.3) 
4,631 (46.6) 


2,066 (39.7) 
5,311 (53.4) 


257.36 


0.001 


Religion 
Buddhist 
Protestantism 
Catholic 
No religion 


5,085 (33.6) 
2,905 (19.2) 
1,144 (7.6) 
5,872 (38.8) 


2,648 (52.1) 
1,481 (51.0) 
566 (49.5) 
3,010 (51.3) 


2,437 (47.9) 
1,424 (49.0) 
578 (50.5) 
2,862 (48.7) 


5.19 


0.392 


Values are presented as number (%). 
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III. Results 

1. Demographic Characteristics of the Subjects 

A total of 15,146 elderly subjects were included in this study, 
and the sample consisted of 6,185 males (40.8%) and 8,961 
females (59.2%). Of the total participants, 7,769 (51.3%) had 
normal nutritional status while 7,377 (48.7%) had poor nutri- 
tional status. Compared with elderly subjects showing good 
nutritional status, the mean age of elderly subjects at risk of 
malnutrition was higher (Table 1). About three-fourths of 
older people living alone were categorized as having poor nu- 
tritional status, and there was a greater proportion of elderly 
subjects with good nutritional status in the higher monthly 
income group. Apart from 54 participants (0.4%), the major- 
ity of participants had been married at least once. While mal- 
nourished elderly subjects comprised about one third of the 
married group, the proportion of malnourished elderly sub- 
jects was much greater in the separated or living alone groups, 
accounting for 73.5% and 65.2%, respectively. 

2. Selecting Important Attributes 

As a result of the feature selection process, 48 important at- 
tributes were identified to be important variables of a total of 
52 input variables shown in Figure 2. Seven variables includ- 
ing chewing abihty, level of hfe satisfaction, depression sta- 
tus, health status, number of diagnosed diseases, difficulties 
in daily activities caused by pain, subjective economic state, 
and monthly income were identified to be most significant 
variables. 

3. Comparing Predictability of the Models using Analysis 
Node 

The highest percentage of correct prediction in training data 
was reported in C5.0; however, the C&R Tree model showed 
relatively higher percentages of correct prediction in both 
training and test data with 78.10% and 80.95%, respectively 
(Table 2). 

4. Modeling of the Data using CEtR Tree 

Figure 2 presents seven finalized input variables that were 
selected during feature selection. The final decision support 
model was designed using the C&R Tree algorithm and is 
shown in Figure 3. 

In this study, a total of six rules were identified to be as- 
sociated with the risk of malnutrition in the elderly. The six 
decision rules were the following: 1) good and very good 
chewing abihty, depressed, level of life satisfaction less than 
2.89; 2) good and very good chewing ability, depressed, level 
of hfe satisfaction above 2.89, severe difficulty in daily activ- 



ity caused by pain; 3) low chewing ability, living with spouse 
or children; low chewing ability, living with spouse or chil- 
dren, depressed; 5) low chewing ability, living with spouse 
or children, not depressed, number of disease above 2.5; 6) 
low chewing ability, living with spouse or children, not de- 
pressed, number of disease less than 2.5, subjective income 
status is poor. 

5. Variables in the Final Modeling 

The traditional statistical method and chi- square test were 
used to find out whether there are significant differences 
between the elderly at risk of malnutrition and normal sub- 
jects in the seven final variables identified by the C&R Tree 
algorithm. All seven variables showed statistical significance 
with ap-value of <0.05 (Table 3). Chewing ability, in particu- 
lar, showed the biggest differences between well -nourished 
participants and malnourished participants. The majority of 
participants with low to moderate chewing ability had poor 
nutrition status (84.1% and 70.2%, respectively) while eight 
out of ten participants with either good or very good chew- 
ing ability were well-nourished. In addition higher propor- 
tions of individuals with depression, low life satisfaction, 
many difficulties in daily activities and poorly perceived eco- 
nomic status were found to be at risk of malnutrition. 

IV. Discussion 

This study explored the applicability of a prediction model 
using data mining of a large community data set. To improve 
the accuracy of knowledge regarding the characteristics of 
the elderly at risk of malnutrition, we identified significant 
rules during the process of data mining. Elderly subjects 
at risk of malnutrition were compared to normal and well- 
nourished elderly subjects to find the patterns associated 
with the occurrence of malnutrition in the elderly popula- 
tion. A tree-structured decision model was designed, and 
its potential application to large amounts of public health 
data was examined. Malnutrition is a commonly reported 
condition among the elderly, and it is considered to be both 
a cause and consequence of many age-related diseases. There 
has been growing concern over poor nutrition status in 
Korea as we are becoming an aging population. Therefore, 
this study was carried out to find patterns related to mal- 
nutrition in the elderly. Using the C&R Tree model, a well- 
designed prediction model was developed, which showed 
good performance in finding associated rules [18]. The C&R 
Tree uses a recursive partitioning method that provides a 
very simple representation that displays accumulated knowl- 
edge with a well-organized structure. To predict continuous 
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Table 2. Predictive performance according to modelling methods 







Training data 


Test data 




Modeling method 


Correct (%] 


Wrong (%) 


Correct (%) 


Wrong (%) 


C5.0 


85.68 


14.32 


80.68 


19.32 


C&R Tree 


78.10 


21.90 


80.95 


19.05 


QUEST 


77.25 


22.75 


78.34 


21.66 


CHAID 


78.59 


21.41 


79.29 


20.71 



Nutritive condition 





node 0 




range 


% 


n 


IVIalnutrition 


48.652 


4,186 


Normal 


54.348 


4,418 


Total 


100.000 


8,604 



Good; very good 

_L 



Chewing ability 

\ 



1 

Low; moderate 





node 1 




range 


% 


n 


Malnutrition 


22.046 


875 


Normal 


77.954 


3,094 


Total 


100.000 


3,969 





node 2 




range 


% 


n 


Malnutrition 


71.435 


3,311 


Normal 


28.565 


1,324 


Total 


100.000 


4,635 



I 

Depression 



Depression state 
\ 



1 

Normal 



I 

Living alone 



Family type 
\ 



1 

Other types 



node 3 
range <x, n 


node 4 
range o/^ ^ 


node 5 
range "/^ ^ 


node 6 
range o/o n 


Malnutrition 50.140 357 
Normal 49.860 355 


Malnutrition 15.904 518 
Normal 84.096 2,739 


Malnutrition 92.055 1,205 
Normal 7.945 104 


Malnutrition 63.319 2,106 
Normal 36.681 1,220 



T 

Life satisfaction 

^ 

= 2.890 



1 

> 2.890 



I 

Depression 



Depression state 

^ 

Normal 



node 7 
range % n 


node 8 
range o/„ „ 


Malnutrition 71.053 108 
Normal 28.947 44 


Malnutrition 44.464 249 
Normal 55.536 311 


Total 100.000 152 


Total 100.000 560 



node 9 
range ^ 


node 10 
range o/o n 


Malnutrition 81.826 968 
Normal 18.174 215 


Malnutrition 53.103 1,138 
Normal 46.897 1,005 


Total 100.000 1,183 


Total 100.000 2,143 



I 

Moderate: none 



Difficulty in daily activity caused by pain 

\ , 



Severe 



Number of disease 
, I 



: 2.500 



I 

> 2.500 



node 11 
range % n 


node 12 
range o/^ 


Malnutrition 39.266 139 
Normal 60.734 215 


Malnutrition 53.398 110 
Normal 46.602 96 


Total 100.000 354 


Total 100.000 206 



I 

Rich; moderate 



Subjective ecomony status 

\ 



Poor 



node 13 
range o/o n 


node 14 
range % „ 


Malnutrition 44.554 675 
Normal 55.446 840 


Malnutrition 73.726 463 
Normal 26.274 165 


Total 100.000 1,515 


Total 100.000 628 



node 15 
range o/^ 


node 16 
range o/^ ^ 


Malnutrition 39.695 363 
Normal 62.305 600 


Malnutrition 56.522 312 
Normal 43.478 240 


Total 100.000 963 


Total 100.000 552 



Figure 3. Decision tree model based on CftR Tree. 



Vol. 20 • No. 1 • January 2014 



www.e-hir.org 35 



Myonghwa Park et al 



Healthcare Informatics Research J^IR 



dependent variables (regression) and categorical predictor 
variables (classification), the C&R Tree builds a classification 
and regression tree [15]. Compared with other classification 
technique used for classification or regression of problems, 
there are many benefits that can be obtained using the C&R 
Tree [21]. The simphcity of the C&R Tree enables clinicians 
to make rapid classification of new clinical observations. 
Moreover, when there is a little prior knowledge, tree meth- 
ods are known to be well suited for data mining tasks for 
data from healthcare settings [22]. 

In this study, seven variables and six rules for the predic- 
tion of malnutrition risk in elderly subjects were identified. 
Those associated rules provide information regarding co- 
occurrence and relationships between risk factors that may 
increase the probability of elderly suffering from malnutri- 
tion. According to the rules identified by the decision tree 
model, elderly people living alone with low chewing ability, 
depression, low hfe satisfaction, comorbidity, functional lim- 
itation and subjective economic issues are more likely to be 



malnourished. Findings are consistent with previous studies 
conducted to identify patient factors associated with the risk 
of malnutrition that was strongly associated with age-related 
changes of older adults. Elderly people with swallowing dif- 
ficulties due to oral problems, such as absence of teeth, poor 
oral hygiene, and gum infections, have a decreased food 
intake and are vulnerable to malnutrition [23]. In addition, 
several studies found that along with functional changes, 
social changes in elderly people's lives (loss, changed family 
structure, and decreased income) may result in depression 
and reduced life satisfaction, increasing the risk of being 
malnourished [24] . 

The final results of the tree method demonstrated the us- 
ability of large pubUc health data sets with good feasibility of 
decision model in the classification of elderly with malnutri- 
tion. Through repeated testing and refinement of data min- 
ing and the C&R Tree in particular, it is anticipated that new 
knowledge will be discovered by more sophisticated analysis 
of healthcare data at the community level [13]. As a result of 



Table 3. Statistical analysis to variables used final modeling 



Variable Total (n = 15,146) Normal (n = 7,769) Malnutrition (n = 7,377) or f p-value 



Chewing ability 
Low 

Moderate 
Good 
Very good 


390 (2.6) 
7,770 (51.3) 
6,586 (43.5) 

400 (2.6) 


62 (15.9) 
2,315 (29.8) 
5,080 (77.1) 
312 (78.0) 


328 (84.1) 
5,455 (70.2) 
1,506 (22.9) 
88 (22.0) 


3,507.49 


0.001 


Family type 
Living alone 
Couple 
With children 
Others 


3,484 (23.0) 
6,653 (43.9) 
4,256 (28.1) 
753 (5.0) 


925 (26.5) 
4,017 (60.4) 
2,432 (57.1) 

395 (52.5) 


2,559 (73.5) 
2,636 (39.6) 
1,824 (42.9) 
358 (47.5) 


1,132.29 


0.001 


Depression status 
Depression 
Normal 


4,423 (29.5) 
10,547 (70.5) 


1,051 (23.8) 
6,652 (63.1) 


3,372 (76.2) 
3,895 (36.9) 


1,927.58 


0.001 


Life satisfaction 


3.42 ± 0.45 


3.58 ± 0.37 


3.26 ± 0.46 


-38.08 


0.001 


Daily activity difficulty 
None 
Little 
Alot 


839 (6.8) 
7,483 (60.4) 
4,064 (32.8) 


561 (66.9) 
4,120 (55.1) 
1,052 (25.9) 


278 (33.1) 
3,363 (44.9) 
3,012 (74.1) 


1,054.79 


0.001 


Diagnosed disease 


2.00 ± 1.54 


1.55 ± 1.30 


2.47 ± 1.62 


46.38 


0.001 


Subjective economy state 
Poor 

Moderate 
Rich 

No answer 


6,949 (45.9) 
7,401 (48.9) 
755 (5.0) 
41 (0.3) 


2,464 (35.5) 
4,748 (64.2) 
543 (71.9) 
14 (34.1) 


4,485 (64.5) 
2,653 (35.8) 
212 (28.1) 
27 (65.9) 


1,320.78 


0.001 



Values are presented as mean ± standard deviation or number (%). 
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this Study, a reliable decision support model was designed 
that provides accurate information regarding the character- 
istics of the elderly with malnutrition. The algorithm used to 
construct the decision tree showed high accuracy, and it is 
expected to facilitate the discovery of discriminatory knowl- 
edge for the targeted problem. The C&R Tree, which was 
based on the C&R Tree method, provided excellent discrimi- 
nation of the characteristics associated with malnutrition in 
the elderly. This decision tree can be utilized to identify com- 
munity residing elderly individuals who are at high risk of 
malnutrition; this will eventually contribute to significantly 
reducing healthcare costs spent on treating malnutrition and 
its complications in the elderly. 
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